System and method for providing an omnimedia package

ABSTRACT

Disclosed is a system that permits a variable number of disparate streams of data to be packaged together as content into a single distinct program referred to as an omnimedia package. A framework definition may be specified and created for the omnimedia package to allow a client set top box to decode the information and provide access to streams comprising video, audio and metadata information. The framework definition may be employed by a broadcast system to create a broadcast stream. A mechanism is described that permits a content provider to define a framework for delivering a package of related content. The framework definition encapsulates information necessary to build, format, transmit and display the disparate content streams. Data may be downloaded to a receiver prior to the broadcast of an associated program. The present invention may be implemented on terrestrial, cable, satellite, VDSL and other transport systems, including those that support upstream communication.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. patent application number 60/253,168 entitled 'OmniMedia Package”, filed Nov. 27, 2000 by Steven Reynolds, which is also specifically incorporated herein by reference for all that it discloses and teaches.

BACKGROUND OF THE INVENTION

[0002] a. Field of the Invention

[0003] The present invention relates to enhanced multimedia television and more particularly to a system and method for organization, combination, transmission and reception of media from a range of sources wherein the media may comprise a plurality of video streams, audio streams, and other information, such as may be accessed via the Internet.

[0004] b. Description of the Background

[0005] The format of television programs often conforms to NTSC, PAL, or SECAM standards wherein a predefined bandwidth, i.e. channel, is employed to carry a single television program. Additional information that may be provided with a program has often been encoded into the vertical blanking interval (VBI), such as closed captioning or alternate language support, for example. As television broadcast formats move to digital transmission, many programs continue to be presented in a manner similar to ‘channel’ based television, comprising one video stream, a primary audio stream, and possibly an alternate audio stream.

[0006] Continued expansion of the Internet and high bandwidth networks provides access to an increasing volume of information. Adoption of digital transmission formats allows media such as audio, video, and metadata content, to be associated, combined, and presented to provide viewers with a richer and more diverse media experience. Methods such as MPEG7 provide for relating content information, but do not provide a method by which content of various formats may be grouped, transmitted and displayed. Therefore a new method of organizing, transmitting, and presenting media from multiple sources is needed.

SUMMARY OF THE INVENTION

[0007] The present invention overcomes the disadvantages and limitations of the prior art by providing a system and method that allows a transmission system to organize and transmit a related set of media and for a display platform to organize and render related media information in a manner that reflects the available media and the capabilities of the platform. A framework definition identifies a set of associated content (media) for a broadcast program. The present invention compares the format of the media with a transmission format and converts media of other formats to that of the transmission format. An omnimenu describes the content. Media content and the omnimenu are combined into a broadcast stream and transmitted.

[0008] The present invention may therefore comprise a method for producing a broadcast stream that contains audio content, video content, and metadata content comprising: creating a framework definition that identifies the audio content, the video content and the metadata content associated with a broadcast and attributes thereof, comparing the audio format of the audio content with an audio transmission format and converting the audio content to the audio transmission format if the audio format and the audio transmission format differ, comparing the video format of the video content with a video transmission format and converting the video content to the video transmission format if the video format and the video transmission format differ, comparing the metadata format of the metadata content with a metadata transmission format and converting the metadata content to the metadata transmission format if the metadata format and the metadata transmission format differ, creating a menu describing the audio content, the video content, and the metadata content, combining the audio content, the video content, and the metadata content into a broadcast stream, transmitting the menu; and transmitting the broadcast stream.

[0009] A framework controller may utilize the framework definition to access media content, to process and format the media content and to control packaging and multiplexing of the content for broadcast.

[0010] The present invention may further comprise a system for combining multiple media and metadata streams having content into a framework for distribution of the content to a viewer comprising: at least one video source having an output, at least one audio source having an output, at least one metadata source having an output, a framework controller that receives the video source, audio source, and metadata source and produces an omnimedia package integrating the outputs into a framework, a framework definition module that interfaces with the framework controller and defines all content to be used in the omnimedia package, a delivery module that receives the omnimedia package from the framework controller and transmits the omnimedia package to a receiver, and a receiver that receives and distributes the content of the omnimedia package to display devices and audio outputs, the receiver further coupled to at least one user input device that provides interactivity between the viewer and the receiver.

[0011] The present invention may utilize pre-loaded content that is transferred to a receiver prior to the broadcast of a media stream with which the preloaded content is associated. Pre-loaded content allows voluminous and complex content to be employed during a broadcast without requiring bandwidth to transfer the pre-loaded content at the time of broadcast and without latencies that may be incurred if the pre-loaded content were transferred at the time of broadcast.

[0012] The present invention may additionally comprise a method for rendering portions of a broadcast stream that contains audio content, video content, and metadata content and a menu indicating the contents of the audio content, video content, and metadata content comprising: transferring preloaded metadata associated with the broadcast stream to a receiver prior to transmission of the broadcast stream, receiving the broadcast stream, displaying the menu wherein the menu includes an icon representing the preloaded metadata, receiving a user input, and rendering the preloaded metadata in response to the user input.

[0013] Advantageously, the present invention allows a viewer to select among a plurality of audio, video and metadata sources to obtain a television presentation tailored to the viewer's preferences, offering increased viewing satisfaction. The present invention may be employed utilizing the capabilities of emerging digital transmission formats to provide an enhanced viewing experience and to increase audience size, allowing increased advertising revenue.

DESCRIPTION OF THE FIGURES

[0014] In the figures,

[0015]FIG. 1 is a high-level block diagram of the present invention.

[0016]FIG. 2 depicts components of framework controller 108.

[0017]FIG. 3 depicts a receiver that may be employed with the present invention.

[0018]FIG. 4 is flow chart 400 of the operation of the metadata processor 214.

[0019]FIG. 5 is a flow chart 500 illustrating the operation of omnimenu generator 224.

DETAILED DESCRIPTION OF THE INVENTION

[0020] The present invention is directed to allowing the creation of larger, more robust productions with support for interactivity and multiple video, audio, and data streams. A framework definition provides organization of media for transmission, and for rendering of media by a display platform that may comprise a television, interactive television, set-top box, satellite receiver, personal computer or other operable to receive data across a network or airwave and process data according to the method of the present invention. The framework definition allows a media stream or multiple media streams to be packaged together with other content into a single distinct program, hereinafter referred to as an omnimedia package. A content provider may employ the framework definition to specify and deliver a package of related content, encapsulating the information necessary to build, format, transmit and display the content. Content may comprise video, audio, and data information that may be streamed or cached. A wide range of information types and formats may be employed as illustrated by table 1. The present invention is not limited to any specific types of information, formats, or relationships between information. TABLE 1 Media Type Format CBR and VBR encoded video streams MPEG, DVB, etc Encoded Audio Streams MP2, MP3 Packet-based encoded video/audio streams QuickTime, AVI, etc Program Synchronous Metadata ATVEF, Java, etc Scripted content HTML, Java, JavaScript, ECMAscript, VBScript, etc Interactive Graphical User Interface HTML, CSS 1, DOM 0 Graphical Elements Logos, still-frame images, JPEG, GIF, etc

[0021] The present invention may be employed with broadcast systems that utilize terrestrial, cable, satellite, VDSL, or other transport methods. The present invention may also be employed with systems in which content is requested via a reverse path and with systems that support local or client side storage of pre-delivered content. Digital format transmission systems are well suited to the method of the present invention, but analog systems with VBI data and Internet connection may also be employed.

[0022]FIG. 1 is a high-level block diagram of the present invention. A plurality of audio/visual and metadata sources, which provide the audio, video, metadata and other data services, are combined to produce an omnimedia package. The term omnimedia describes the inclusive nature of the content being delivered by the present invention, as contrasted with the usual audio/video packaging associated with television events. Thus, for a given event, the omnimedia package represents all of the synchronous and asynchronous components from video source(s) 102, audio source(s) 104, and metadata sources 106, which may comprise and may be associated with a primary broadcast. Video source(s) 102 may comprise standard television broadcast channels (analog or digital modulation and encoding), any form of analog video (for example, Beta and VHS format tapes), any form of stored digital video (for example, Laserdisc, DVD or any server-based digital video, such as may be used by video-on-demand systems) or may be any form of packet-based streaming media. This may include alternate camera sources, supporting video, alternate advertising video, localized or regionalized video. Audio source(s) 104 may comprise soundtracks that accompany one or more video source(s) 102, or may comprise an independent stream such as a radio signal or other audio only broadcast. Framework definition 110 defines the relationships between different content streams. For video source(s) 102 there may be zero, one or several associated audio source(s) 104. Further, a given audio source 104 may be assigned to zero, one or more video source(s) 102. There is no requirement that any given video source 102 have an accompanying audio source 104. Audio may be encoded with a video source, such as may be practiced with packet-based streaming media. Metadata source(s) 106 may contain data packaged with respect to an event or associated with a general subject and may include executable code, scripts, graphics, logos, or general data. Metadata may be time synchronized, content related, or ancillary to an event.

[0023] Following are three examples illustrating the types and relationships between video source(s) 102, audio source(s) 104, and metadata source(s) 106, and how these sources may be organized and associated to provide a richer media presentation. In Example One, an omnimedia program is provided for the French Open, a tennis tournament featuring several simultaneous events. The primary broadcast may be from the main (or center) court with accompanying broadcasts from other venues.

Example One

[0024] Source ID Source Type Source Description 102.1.A High bit rate video Video feed from center court, stream intended for featuring players A and B. This is the full screen video. Primary Video to be broadcast. This is the video that is broadcast to non-omnimedia enabled television systems. 104.1.A Audio Track This is the English audio track for 102.1.A. 104.1.B Audio Track This is the French audio track for 102.1.A. 102.1.B Low bit rate video Camera focused on player A's coach stream intended for and family members. scaled video window. 102.1.C Low bit rate video Camera focused on player B's coach stream intended for and family members. scaled video window. 102.1.D Medium bit rate video Alternate camera angle 1 from behind stream. player A. 102.1.E Medium bit rate video Alternate camera angle 2 from behind stream. player B. 102.1.F Medium bit rate video Video feed from court 2, featuring stream. players C and D. 104.1.C Audio Track This is the English audio track for 102.1.F. 104.1.D Audio Track This is the French audio track for 102.1.F. 102.1.G Packet-based video Video and audio feed from court 5, stream with embedded featuring a mixed doubles match audio track. between two poorly seeded pairs. 106.1.A HTML Document Interactive page that shows statistics for match in progress, tightly synchronized to 102.1.A. (video stream). 106.1.B HTML Document Interactive page that shows summary of all matches in progress, loosely synchronized to 102.1.A, 102.1.B and 102.1.G (video streams). 106.1.C HTML Document Interactive page that shows results of today's matches and contains links to upcoming matches (links navigate to 106.1.D and 106.1.E). 106.1.D HTML Document Interactive page that provides detailed commentary for an earlier match 106.1.E HTML Document Interactive page that shows statistics and commentary for upcoming match 106.1.F JPEG Image Logo image to be used by Interactive pages (106.1.A-106.1.E).

[0025] Example Two shows the components of an omnimedia package relating to the fourth day of the Masters, a major golf tournament.

Example Two

[0026] Source ID Source Type Source Description 102.2.A High bit rate video Video feed following, as appropriate stream intended for and opportune, the various leaders. full screen video This is the Primary Video to be broadcast. This is the video that is broadcast to non-omnimedia enabled television systems. 104.2.A Audio Track This is the audio track for 102.2.A. 102.2.B High bit rate video Video feed following Tiger Woods. stream intended for full screen video 104.2.B Audio Track. This is the audio track for 102.2.B. 106.2.A HTML Document Interactive page showing Tiger Woods score card for the round in progress 106.2.B HTML Document Interactive page showing leader board for tournament 106.2.C HTML Document Interactive page showing past tournament winners 106.2.D Java Code Executable code that instructs consumer device to show an animated view of the current hole in play on 102.2.A (loosely synchronized to 102.2.A)

[0027] Example Three shows the components of an omnimedia package relating to a retransmission of the H. G. Wells “War of the Worlds” radio broadcast.

Example Three

[0028] Source ID Source Type Source Description 104.3.A Audio Track Audio feed of the original broadcast. This is the Primary Audio to be broadcast. This is the video that is broadcast to non-omnimedia enabled audio systems such as an automobile's radio. 104.3.B High Quality This is an audio track that has been Audio Track. redubbed and digitally remastered with added stereo sound effects. 102.3.A High bit rate video Video feed of an interview with stream intended for H. G. Wells about the making of and full screen video hysteria generated by the original broadcast. 104.3.C Audio Track. This is the audio track for 102.3.A. 106.3.A HTML Document Interactive page showing program synchronous factoids about the original broadcast of “War of the World”. Tightly synchronized to 104.3.A. 106.3.B JPEG Image Program synchronous images showing photos, artwork and press clippings related to 104.3.A.

[0029] Framework definition 110 describes all content used in an omnimedia package. Framework controller 108 employs framework definition 110 to create an omnimedia package. Framework definition 110 defines the primary video and audio to be used by non-omnimedia aware platforms, plus initial audio and video for omnimedia package streams. Framework definition 110 is also employed to generate a main-menu/omnimenu. An omnimenu is an interactive user interface that presents a menu of options available with an omnimedia package. Options may comprise different video streams, different television layout, different audio sources, and other interactive data streams. The definition of each stream may include a start/stop time to enable framework controller 108 to switch streams during when creating the omnimedia package. An omnimenu may be presented in a range of formats and may be customized by the system operator. For example, one embodiment of an omnimenu may scale current video into an upper right-hand corner and display a list of options to the left of alternate video sources, alternate audio sources, and links to other data. Another embodiment of an omnimenu may employ a pre-defined TV layout comprising a ¾ screen primary video format and several secondary thumbnail videos and data indicators across the bottom of the screen.

[0030] Framework definition 110 may be created at a production facility or other site and may employ automated and manual methods of associating media. The framework definition 110 may reflect limitations and constraints of source delivery systems, transmission systems, and display platform limitations. Such limitations may include available bandwidth, number of channels tuned by a display platform, cache size, supported formats or other parameters. A framework definition may specify that images must be in GIF format and that HTML code must support version 3.23, for example. For each event or presentation, a framework definition record may be produced that may comprise information defining the type, size, nature, pricing, scheduling, and resource requirements for each media offered. Table 2 lists a number of components that may comprise a framework definition record. TABLE 2 Framework Definition Record Components Version of the omnimedia package Onmimenu associated with the omnimedia package, may be created by framework controller. Parameters used by framework controller 108 to allocate, control and prioritize transmission of stream elements. Flags/Category, languages, feature set, including multiple ratings available (e.g. PG thru R) Parameters employed to control delivery. (Tuning, URL, etc...) Price, if any: An associated cost for the omnimedia package Time the omnimedia package expires The number of framework definition stream records A plurality of stream records

[0031] A stream record is provided for each media source (audio, video, or metadata) to be offered in the omnimenu. Each framework definition stream record may comprise:

[0032] a unique ID;

[0033] a primary Y/N selection indicator;

[0034] the media Type (audio/video/data/other);

[0035] the bandwidth required for the media;

[0036] a start time/date time;

[0037] the date and time the omnimedia package expires;

[0038] a description having a paragraph or two describing the data stream (by way of example, an audio stream could include the person speaking/band playing, a video stream could include the camera angle, or location, and a data stream could describe the interactive content and what it does. These are used by the framework controller 108 when building the omnimenu;

[0039] a location (framework controller 108 would fill this out most likely) PID;

[0040] flags category, languages, feature set, including multiple ratings available (e.g. PG thru R);

[0041] price, including an associated cost for the stream as well as an additional fee beyond the cost of the program such that to see a certain rating version of a show or to see video instead of just hearing audio may require a premium payment;

[0042] a unique Group ID within an Event that identifies a group of streams that are used by the framework controller 108 when building the omnimenu (note that only one stream from the group can be used at one time); and

[0043] any Dependent Streams, each having a unique stream ID which is required to access/use the stream. In use, a stream may require another stream. These stream IDs may be used by the set top box to determine if the stream is available, i.e., an audio stream may be dependent on the video stream so that only devices that can view/use the video stream can access the audio stream. Also, the stream IDs can be used by head end equipment that is trying to optimize bandwidth and may want to separate different streams on different transponders for the same event.

[0044] An example of a framework definition is shown below: <XML> <Version>4</Version> <Menu>Showme.html</Menu> <Flags> <Language>English</Language> <Rating>PG-13</Rating> </Flags> <Control> ???????? </Control> <Price>2.99</Price> <Expire>02/17/00 21:00</Expire> <StreamRecords>1</StreamRecords> <StreamRecord1> <ID>459812</ID> <Primary>True</Primary> <MediaType>Video</MediaType> <Bandwidth>256</Bandwidth> <Start>02/15/00 22:00</Start> <Expire>02/17/00 21:00</Expire> <Description></Description> <Location>PID:200</Location> <Flags> <Language>English</Language> <Rating>PG-13</Rating> </Flags> <Price>2.99</Price> <GroupID>002</GroupID> <Dependant>459810</Dependant> </StreamRecord1> </XML>

[0045] Employing framework definition 110, the framework controller 108 is operable to format and organize media components into a stream or streams that may be delivered by the delivery system 112. Delivery system 112 may comprise a headend system and transmission apparatus such as employed by cable, satellite, terrestrial, and other broadcast systems. The organization and establishment of the stream(s) employs parameters provided in framework definition 110. The stream(s) is (are) delivered to receiver 114 that processes the stream(s) and provides output to display device(s) 116, audio output(s) 118, and may send and receive signals to/from user input device(s) 120.

[0046]FIG. 2 depicts components of framework controller 108. Framework controller 108 includes framework control logic 216 that is operable to retrieve and interpret framework definition 110, and employ the parameters thereof to control operation preprocessors 210, 212, and 214, plus packagers 218, 220, and 222 to format and encapsulate information (video, audio and metadata) for multiplexer 226. Video preprocessor 210 and audio preprocessor 212 access media streams or stored data as specified by framework definition 110 and perform processing to prepare the media for the associated packager. Such processing may include rate adaptation, re-encoding, transcoding, format conversions, re-sampling, frame decimation, or other techniques and methods to generate a format and data rate suitable for packagers 218, 220, and 222 as may be specified by framework definition 110. Processing may include MPEG encoding of analog video as may be supported by encoder equipment from Divicom Inc. of Milpitas, Calif., which is a wholly owned subsidiary of C-Cube Microsystems, Inc. When such processing is specified, framework controller 108 may provide a sequence of instructions to the encoder, selecting channels and controlling encoding.

[0047] Metadata preprocessor 214 accesses metadata elements, specified by framework definition 110, and performs processing to prepare these the metadata packager 222. Such processing may include transcoding, format conversions, script conversions, script generation, and image format conversions, for example. In operation, graphical metadata may be sent to metadata preprocessor 214 in a computer graphics format (Photo Shop, for example) that then may be converted to a format that the display platform recognizes (gif for example).

[0048]FIG. 4 is flow chart 400 of the operation of the metadata processor 214. At step 402 an image file is accessed by metadata preprocessor 214. The metadata preprocessor accesses metadata and places it in a predetermined format as may be specified by framework definition 110. Metadata may comprise graphics data, sound data, HTML data, video data, or any other type of data. Metadata preprocessor 214 processes the data and outputs the processed data in a transmission/multiplexer format. As shown in FIG. 4, the flow diagram allows the image file to be converted and output by the metadata processor 214 in real time. At step 404 the graphic file conversion definitions are loaded into the metadata preprocessor 214 to perform the conversion of the image file. At step 406 the image file 402 is converted into the graphic file in accordance with the definitions. At step 408 the metadata preprocessor 214 outputs the converted image file to the metadata packager 220. A high-speed processor may perform these functions in real time using definitions that can be quickly downloaded from a high-speed storage device. Custom designed state machines may also be employed for format conversion. Metadata preprocessor 214 may also be employed to convert the format of HTML metadata. HTML metadata may be sent to metadata processor 214 in HTML 4.0 format that then may be converted to HTML 3.2 format such as may be required by a display platform. The framework controller generates commands that are sentto metadata processor 214 that identify the metadata and specify the output format of the metadata.

[0049] Referring again to FIG. 2, video packager 218 and audio packager 220 are subsystems that package video and audio assets into a format that is compatible with the delivery system 112, including for example, packetization, serialization, rate control, null-packet insertion, and other functions required to prepare the video for compliant transport via MPEG, DVB-C/S/T, PacketVideo, or other transport formats. For example, General Instruments (now owned by Motorola) produces a QAM modulator that modulates MPEG video that is encoded from video preprocessor 210 to digital video broadcast (DVB) format. The framework controller generates commands that are sent to the QAM modulator specifying the frequency (channel) and PID (packet identifier) for the video based upon the framework definition that was provided for the omnimedia package.

[0050] As also shown in FIG. 2, metadata packager 222 is a subsystem that performs packaging on all metadata elements that are to be included in the omnimedia package. Metadata packaging may comprise rate control, packetization, serialization, and synchronization to video and/or audio streams. The metadata is also prepared for transport across any compliant transport mechanism (MPEG, DVB-C/S/T, PacketCable, DVB-MHP, etc.). A commercially available product for performing these functions is the TES3 that is provided by Norpak Corp., Kanata, Ontario. The TES3 encoder encodes metadata into a NTSC signal with NABTS encoding. NABTS is the protocol that allows metadata to be sent in the VBI (vertical blanking interval) of a NTSC signal. The framework controller commands the TES3 encoder as to what lines of the VBI are employed for transmitting metadata.

[0051] Referring again to FIG. 2, omnimenu generator 224 may be implemented as a rules-based subsystem employing framework definition 208 to generate a user-interface that presents program options and allows a viewer to select from these options. Rules may be employed to generate an omnimenu template in HTML page format. The HTML page may comprise a full screen image containing buttons (active icons) that may be selected to activate a particular media stream. Omnimenu generator 224 employs framework definition 208 to identify available streams and associates each stream with a button.

[0052]FIG. 5 is a flow chart 500 illustrating the operation of omnimenu generator 224. At step 502, omnimenu generator 224 accesses the framework definition 110. At step 504, PIDS (program IDs) are extracted from the framework definition. The omnimenu template, described above, is loaded into the omnimenu generator 224 at step 506 of FIG. 5. At step 508, each of the PIDS that have been extracted from the framework definition are assigned to a button such that each of the buttons is labeled with the PID name. At step 510, selection functions are assigned to each of the buttons in accordance with the labels that have been assigned to those buttons. In this manner, the video can be changed in accordance with the labeled functions of each of the buttons of the template. At step 512 the template may be assigned a company logo. In this manner, the content may be properly branded to correspond to the source of that content. At step 514 the omnimenu generator 224 saves or exports the new omnimenu. The onmimenu may be exported to the package multiplexer 226.

[0053] As shown in FIG. 2, package multiplexer 226 combines the package elements into a stream or set of streams, in preparation for transmission. The streams are coupled through delivery system interface(s) 228, to delivery system 112, and the physical and logical interconnects to the transmission system. A Divicom, Inc., (Milpitas, Calif.) MUX is an example of a package multiplexer, having the different frequency and PIDs.

[0054] As indicated in FIG. 1, delivery system 112 then transports the streams to the receiver 114. Delivery system 112 may be analog or digital. In general, system delivery of content is as open as needed for any particular package and topology. A simple movie with no added content may be delivered on a single broadcast multiplex, while a package for the Super Bowl would contain many different delivery mechanisms. Some may be available on a particular receiver and not on others.

[0055] The omnimenu may provide a directory of available media for a broadcast event. The omnimenu is transmitted to a plurality of receivers 114. Receivers 114 may vary in capability and may include upstream (reverse path) communication from the receiver to the headend system, or may use other return systems such as an Internet connection, for example. Receivers 114 that do not include upstream communications may employ the omnimenu to select audio, video and metadata information contained in a broadcast stream. The bandwidth of an analog NTSC channel may be employed to carry several digital video streams, audio streams, and metadata. The omnimenu includes tuning or packet information to identify streams that may be accessed by the receiver. The receiver includes a software program operable to display the omnimenu and operable to tune and render selected streams.

[0056] In another embodiment of the present invention, receiver 114 supports upstream communications. The headend system, in response to upstream communications, may provide on-demand programming and data delivery. The omnimenu initiates upstream communication in response to user selection of a displayed media button (icon). The headend system may supply the requested stream in a broadcast channel, or if already broadcast, may provide tuning or packet decode information to the receiver. The requested stream may be employed to update the framework definition 110. Framework controller 108 uses the framework definition 110 to allocate bandwidth and PID information. The framework controller provides the frequency, PID and bandwidth information from the framework definition and uses it to send/control video and audio packager 218/220. Video packager 218 and audio packager 220 (FIG. 2) then allocate bandwidth for each of the respective video streams.

[0057] As also shown in FIG. 1, information from the framework controller 108 is encoded into the omnimenu so that the receiver will be able to tune to and decode the streams. The framework controller 108 may also include URL's for demand data or streaming media. Locations based on alternate tuner systems may also be included. For example, a radio station frequency having local commentary may be simulcast with the video. Depending of receiver 114 capability, some data and/or programming may be loaded in advance into storage built into the receiver such that the content is available locally for viewing during the airing of the primary content package. For example, all NFL player statistics may be preloaded over a trickle feed before the Super Bowl. In this manner, an interactive fantasy football application may retrieve all needed statistics during the game in order to let the viewer play a fantasy game during the airing of the primary program.

[0058]FIG. 3 depicts a receiver that may be employed in accordance with the present invention. Receiver 302 is used by the system end-user or viewer for use in viewing and controlling the omnimedia package. The receiver 302 may comprise a decoder 304, parser 306, media controller 308, receiver cache 310 and receiver controller 312. Decoder 304 may extract framework information from the delivery stream omnimenu. Information may be encoded in the vertical blanking interval. Decoder 304 may comprise a NABTS VBI decoder from ATI Technologies, Inc., of Thornhill, Ontario. The VBI decoder may extract data from the VBI and present it to framework parser 306. The data may comprise an XML format file of the omnimenu. Parser 306 extracts elements that comprise the framework. Framework parser 306 receives framework data from decoder 304 and prepares the data for use by the receiver controller 312. The receiver controller 312 may comprise a data interpretation module. Media controller 308 selects the media streams (audio, video and/or data) that are described by the framework definition. The media controller comprises 308 a tuner and video/audio decoder. The media controller 308 receives stream data from decoder 304 and control signals from receiver controller 312. Media controller 308 selects the proper media PIDS as needed and feeds them to display device(s) 116, audio device(s) 118, or user input device(s) 120. Receiver controller 312 serves as the central processor for the other elements of receiver 302. The receiver controller 312 may compare the capabilities of receiver 302 with the stream type to determine which streams may be received and used by receiver 302. Receiver controller 312 sends control signals to the other units in the system. The functions of receiver controller 312 may be performed by software resident in the control CPU of the receiver 302. Functions include receiving data from framework parser 306 and signaling media controller 308. For example, selection of omnimenu items may alter the framework such that the receiver controller 312 receives information from framework parser 306 and signals media controller 308 to render the selected media stream. Receiver cache 310 may be employed to store the framework definition and any parameters, code objects, data items or other software to control omnimenu display and response to user input.

[0059] Receiver 302 receives data associated with a content package and employs the data to access the package contents. Advanced receivers may check system capabilities to determine which pieces of content may be rendered. For example, a digital set top box may be able to decode MPEG video and audio. A digital set top box may also be able to decode MP3 audio, but not be able to decode HDTV signals. A radio type receiver may only be able to decode audio formats and, possibly, only formats not related to a video signal. The receiver may also be able to decode various data types from data services. These may take the form of application code that would be executed on the receiver if compatible. The receiver of the present invention has the internal compatibility to be able to receive the omnimedia packaged signal and decode the parts that are relevant to its capabilities to give the fullest experience it can, regardless of the format.

[0060] Referring again to FIG. 1, display device(s) 116 provide(s) the visual presentation of the framework to the end-user/viewer, including video, streaming media, a graphical user interface, static graphics, animated graphics, text, logos and other displayable assets. Similarly, audio output(s) 118 present(s) the audio portions of the framework to the end-user, including primary audio, possible secondary audio tracks, streaming media, audio feedback from the receiver 114, and other audio assets. User input device(s) 120 allow(s) the user to control the receiver and interact with the framework, providing the ability to choose components of the framework, select links within the framework, navigate the graphical user interface of the receiver, or perform other interactions with the receiver.

[0061] The foregoing description of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and other modifications and variations may be possible in light in the above teachings. The embodiment was chosen and described in order to best explain the principles of the invention and its practical application to thereby enable others skilled in the art to best utilize the invention in various embodiments and various modifications as are suited to the particular use contemplated. It is intended that the appended claims be construed to include other alternative embodiments of the invention except insofar as limited by the prior art. 

What is claimed is:
 1. A method for producing a broadcast stream that contains audio content, video content, and metadata content comprising: creating a framework definition that identifies said audio content, said video content and said metadata content associated with a broadcast and attributes thereof; comparing the audio format of said audio content with an audio transmission format and converting said audio content to said audio transmission format if said audio format and said audio transmission format differ; comparing the video format of said video content with a video transmission format and converting said video content to said video transmission format if said video format and said video transmission format differ; comparing the metadata format of said metadata content with a metadata transmission format and converting said metadata content to said metadata transmission format if said metadata format and said metadata transmission format differ; creating a menu describing said audio content, said video content, and said metadata content; combining said audio content, said video content, and said metadata content into a broadcast stream; transmitting said menu; and transmitting said broadcast stream.
 2. The method of claim 1 wherein said framework definition further comprises: a framework definition record for each element of said audio content, said video content, and said metadata content.
 3. The method of claim 1 wherein said menu further comprises: an icon for each element of said audio content, said video content, and said metadata content.
 4. The method of claim 1 wherein said metadata content comprises an image file.
 5. The method of claim 5 wherein said converting said metadata content further comprises: loading said image file; loading a file conversion definition; converting said file using said conversion definition; and outputting a converted image file.
 6. A method for rendering portions of a broadcast stream that contains audio content, video content, and metadata content and a menu indicating the contents of said audio content, video content, and metadata content comprising: transferring preloaded metadata associated with said broadcast stream to a receiver prior to transmission of said broadcast stream; receiving said broadcast stream; displaying said menu wherein said menu includes an icon representing said preloaded metadata; receiving a user input; and rendering said preloaded metadata in response to said user input.
 7. A system for combining multiple media and metadata streams having content into a framework for distribution of the content to a viewer, comprising: at least one video source having an output; at least one audio source having an output; at least one metadata source having an output; a framework controller that receives said video source, audio source, and metadata source and produces an omnimedia package integrating said outputs into a framework; a framework definition module that interfaces with said framework controller and defines all content to be used in the omnimedia package; a delivery module that receives said omnimedia package from said framework controller and transmits said omnimedia package to a receiver; and a receiver that receives and distributes the content of said omnimedia package to display devices and audio outputs, said receiver further coupled to at least one user input device for providing interactivity between said viewer and the receiver. 